The Ins and Outs of Dutch noun countability classification
نویسندگان
چکیده
This paper presents a range of methods for classifying Dutch noun countability based on either Dutch or English data. The classification is founded on translational equivalences and the corpus analysis of linguistic features which correlate with particular countability classes. We show that crosslingual classification on the basis of word-to-word or featureto-feature mappings between English and Dutch performs at least as well as in-language classification based on gold-standard Dutch countability data.
منابع مشابه
Crosslingual Countability Classification with EuroWordNet
We examine the hypothesis that noun countability is consistent for a given word semantics by way of a series of experiments involving EuroWordNet and the English and Dutch languages. The basic method involves determining a default set of countabilities for each EuroWordNet synset based on countability-mapped words in that synset, and testing the match between these countabilities and those of h...
متن کاملCrosslingual Countability Classification: English meets Dutch
This paper presents a range of methods for classifying Dutch nouns as countable, uncountable or plural only based on both Dutch and English data. The classification is based on the occurrence of countability specific linguistic features that are extracted from unannotated corpora. We show that in the absence of reliable Dutch gold standard data, cross-linguistic classification can be achieved o...
متن کاملMass counts in World Englishes: A corpus linguistic study of noun countability in non-native varieties of English
Research on the morpho-syntax of non-native varieties of English has reported a widespread presence of mass noun pluralization such as baggages, equipments and softwares. In this paper we conducted a corpus linguistic study in order to provide empirically substantiated answers to this claim. We examined the purported prevalence of noun countability in World Englishes in a 1.9 billion-token mega...
متن کاملA sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon
The present paper describes the current release of the Bochum English Countability Lexicon (BECL 2.1), a large empirical database consisting of lemmata from Open ANC (http://www.anc.org) with added senses from WordNet (Fellbaum, 1998). BECL 2.1 contains ≈ 11,800 annotated noun-sense pairs, divided into four major countability classes and 18 fine-grained subclasses. In the current version, BECL ...
متن کاملArtifact Nouns: Reference and Countability
Formal semantic treatments of nominal reference have focused on natural kind nouns (Carlson 1980, Krifka et al. 1995), yet as has been widely recognized by anthropologists, cognitive scientists, philosophers, and others, artifacts (cup, furniture, chair) and natural kinds (dog, water, flower) differ critically in nature. We argue that these differences have important consequences for how artifa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003